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Abstract — Construction of high rate Space Time Block Codes 
(STBCs) with low decoding complexity has been studied widely 
using techniques such as sphere decoding and non Maximum- 
Likelihood (ML) decoders such as the QR decomposition de- 
coder with M paths (QRDM decoder). Recently Ren et al., 
presented a new class of STBCs known as the block orthogonal 
STBCs (BOSTBCs), which could be exploited by the QRDM 
decoders to achieve significant decoding complexity reduction 
without performance loss. The block orthogonal property of the 
codes constructed was however only shown via simulations. In 
this paper, we give analytical proofs for the block orthogonal 
structure of various existing codes in literature including the 
codes constructed in the paper by Ren et al. We show that codes 
formed as the sum of Clifford Unitary Weight Designs (CUWDs) 
or Coordinate Interleaved Orthogonal Designs (CIODs) exhibit 
block orthogonal structure. We also provide new construction 
of block orthogonal codes from Cyclic Division Algebras (CDAs) 
and Crossed-Product Algebras (CPAs). In addition, we show how 
the block orthogonal property of the STBCs can be exploited 
to reduce the decoding complexity of a sphere decoder using a 
depth first search approach. Simulation results of the decoding 
complexity show a 30% reduction in the number of floating point 
operations (FLOPS) of BOSTBCs as compared to STBCs without 
the block orthogonal structure. 

I. Introduction & Preliminaries 

Consider a minimal-delay space-time coded Rayleigh quasi- 
static fiat fading MIMO channel with full channel state infor- 
mation at the receiver (CSIR). The input output relation for 
such a system is given by 



Y = HX + N, 
is the channel matrix and N G 



(1) 



where H G C™' 

the additive noise. Both H and N have entries that are i.i.d. 
complex-Gaussian with zero mean and variance 1 and Nq 
respectively. The transmitted codeword is X 6 (^n t xn t an( j 
Y e C"' xn * is the received matrix. The ML decoding metric 
to minimize over all possible values of the codeword X, is 



MfX) 



Y-HX 



(2) 



Definition 1: ffl: A linear STBC C over a real (1- 
dimensional) signal set S, is a finite set of n t x n t matrices, 
where any codeword matrix belonging to the code C is 
obtained from, 



K 



X(Xi,X2,—,Xk) 



(3) 



i=l 



by letting the real variables x\, X2, • • • , xk take values from a 
real signal set S, where Aj are fixed n t x n t complex matrices 
defining the code, known as the weight matrices. The rate of 
this code is complex symbols per channel use. 

We are interested in linear STBCs, since they admit sphere 
decoding (SD) (2) and other QR decomposition based decod- 
ing techniques such as the QRDM decoder which are fast 
ways of decoding for the variables. 

Designing STBCs with low decoding complexity has been 
studied widely in the literature. Orthogonal designs with single 
symbol decodability were proposed in J4), 0, J6). For STBCs 
with more than two transmit antennas, these came at a cost 
of reduced transmission rates. To increase the rate at the 
cost of higher decoding complexity, multi-group decodable 
STBCs were introduced in Q, 0, J9). Another set of low 
decoding complexity codes known as the fast decodable codes 
were studied in iflOl . Fast decodable codes have reduced SD 
complexity owing to the fact that a few of the variables can 
be decoded as single symbols or in groups if we condition 
them with respect to the other variables. Fast decodable codes 
for asymmetric systems using division algebras have been 
reported JTT|. The properties of fast decodable codes and 
multi-group decodable codes were combined and a new class 
of codes called fast group decodable codes were studied in 

m. 

A new code property called the block-orthogonal property 
was studied in [3) which can be exploited by the QR- 
decomposition based decoders to achieve significant decoding 
complexity reduction without performance loss. This property 
was exploited in fPH to reduce to the average ML decoding 
complexity of the Golden code lfT4l and also in ifTBI to reduce 
the worst-case complexity of the Golden code with a small 
performance loss. While the other low decoding complexity 
STBCs use the zero entries in the upper left portion of the 
upper triangular matrix after the QR decomposition, these 
decoders utilize the zeroes in the lower right portion to reduce 
the complexity further. 

The contributions of this paper are as follows: 

• We generalize the set of sufficient conditions for an STBC 
to be block orthogonal provided in J3] for sub-block sizes 
greater than 1. 

• We provide analytical proofs that the codes obtained from 
the sum of Clifford Unitary Weight Designs (CUWDs) 
IfTBI exhibit the block orthogonal property when we 



choose the right ordering and the right number of ma- 
trices. 

> We provide new methods of construction of BOST- 
BCs using Coordinate Interleaved Orthogonal Designs 
(CIODs) El, Cyclic Division Algebras (CDAs) JH and 
Crossed Product Algebras (CPAs) along with the 
analytical proofs of their block orthogonality. 

• We show that the ordering of variables of the STBC used 
for the QR decomposition dictates the block orthogonal 
structure and its parameters. 

> We show how the block orthogonal property of the 
STBCs can be exploited to reduce the decoding complex- 
ity of a sphere decoder which uses a depth first search 
approach. 

• We provide bounds on the maximum possible reduction 
in the Euclidean metrics (EM) calculation during sphere 
decoding of BOSTBCs. 

• Simulation results show that we can reduce the decoding 
complexity of existing STBCs by upto 30% by utilizing 
the block orthogonal property. 

The remaining part of the paper is organized as follows: In 
Section HI] the system model and some known classes of low 
decoding complexity codes are reviewed. In Section [TTT1 we 
derive a set of sufficient conditions for an STBC to be block 
orthogonal and also the effect of ordering of matrices on it. In 
Section HVl we present proofs of block orthogonal structure of 
various existing codes and also discuss some new methods of 
constructions of the same. In Section [V] we discuss a method 
to reduce the number of EM calculations while decoding a 
BOSTBC using a depth first search based sphere decoder and 
also derive bounds for the same. Simulation results for the 
decoding complexity of various BOSTBCs are presented in 
Section |VT] Concluding remarks constitute Section IVHI 

Notations: Throughout the paper, bold lower-case letters 
are used to denote vectors and bold upper-case letters to 
denote matrices. For a complex variable x, denote the real and 
imaginary part of x by xj and xq respectively. The sets of all 
integers, all real and complex numbers are denoted by Z, R 
and C, respectively. The operation of stacking the columns of 
X one below the other is denoted by uec(X). The Kronecker 
product is denoted by <X>, Iy and Or denote the TxT identity 
matrix and the null matrix, respectively. For a complex variable 
x, the (.) operator acting on x is defined as follows 



Xi 
XQ 



-XQ 
XI 



The (.) operator can similarly be applied to any matrix X G 
C nxm by replacing each entry Xij by Xij, i = 1, 2, • • ■ , n, j = 
1, 2, ■ ■ ■ , m, resulting in a matrix denoted by X G jj2nx2m 



Given a complex vector x = [x\,X2, • • ■ ,x n ] , x is defined 

as x = [xn, Xiq, • • • , x nI , x nQ ] T . 



II. System Model 



For any Linear STBC with variables x\, X2---,j£i£j£rven by 
©, the generator matrix G iflOl is defined by vec (X) = Gx, 



where x = [xi,X2-~,xk] ■ In terms of the weight matrices, 
the generator matrix can be written as 

G= vec (Ai) vec (A 2 ) ••• vcc(Ak) 

Hence, for any STBC, (Q]i can be written as 

vec (Y) = H eg x + vec (N), 



where H e „ G 



i2n r n t xK 



is given by H eq 



(I, 



H)G, 



and x = [xi, X2---, xk] , with each Xi drawn from a 1- 
dimensional (PAM) constellation. Using the above equivalent 
system model, the ML decoding metric (j2]i can be written as 



M(x) =|| wec(Y) 



H eg x 



s 11 2 



Using QR decomposition of H eg , we get H eq = QR where 
Q G R 2n r- n t xK is an orthonormal matrix and R G R KxK 
is an upper triangular matrix. Using this, the ML decoding 
metric now changes to 



M(x) =|| Q J uec(Y) Rx 



S l|2. 



Rx 



S II 2 



(4) 



If we have H eq = [hih2...,hjf] , where hj,i G 1,2, ...,K are 
column vectors, then the Q and R matrices have the following 
form obtained by the Gram-Schmidt orthogonalization: 



Q = tai q 2 fbc] , 

where i G 1, 2, K are column vectors, and 



(5) 



R 



ri 










II r 2 || 






(q 2 . h 3> 

II r 3 II 



(qi,h/0 
(q 2 > h /0 
(q 3) h i0 

II r K || 



(6) 



where ri = hi, q x = tAtt and for i = 2, ...K, 



r t =h i -^2(q j ,h l )q j , q. 



A. Low decoding complexity codes 

A short overview of the known low decoding complexity 
codes is given in this section. The codes that will be described 
are multi-group decodable codes, fast decodable codes and fast 
group decodable codes. 

In case of a multi-group decodable STBC, the variables can 
be partitioned into groups such that the ML decoding metric 
is decoupled into submetrics such that only the members of 
the same group need to be decoded jointly. It can be formally 
defined as 0, El, ED: 

Definition 2: An STBC is said to be (/-group decodable 
if there exists a partition of {1, 2, K} into g non-empty 
subsets ri,r 2 ,...,r g such that the following condition is 
satisfied: 

A,A,^ + A ro Af = 0, 
whenever I G Tj and m G Tj and i ^ j. 



If we group all the variables of the same group together in |@), 
then the R matrix for the SD 13, l20l in case of multi-group 
decodable codes will be of the following form: 



R 



Ai 
A 2 








X, 



(7) 



where Aj, i = 1, 2, ...,g is a square upper triangular matrix. 

Now, consider the standard SD of an STBC. Suppose the R 
matrix as defined in © turns out to be such that when we fix 
values for a set of symbols, the rest of the symbols become 
group decodable, then the code is said to be fast decodable. 
Formally, it is defined as follows: 

Definition 3: An STBC is said to be fast SD if there exists 
a partition of {1,2, L} where L < K into g non-empty 
subsets ri,r2,...,r g such that the following condition is 
satisfied for all i < j 



(8) 



whenever i € T p and j <G T q and p ^ q where q t and hj are 
obtained from the QR decomposition of the equivalent channel 
matrix H eq = [hih 2 ..., h^] = QR with hj,i G 1,2,..., if as 
column vectors and Q = [q± q 2 ... q K ] with q i , i e 1,2, K 
as column vectors as defined in 10. 

Hence, by conditioning K — L variables, the code becomes 
g-group decodable. As a special case, when no conditioning 
is needed, i.e., L = K, then the code is g-group decodable. 
The R matrix for fast decodable codes will have the following 
form: 

" A Bi 
B 2 



R 



(9) 



where A is an L x L block diagonal, upper triangular matrix, 
B 2 is a square upper triangular matrix and Bi is a rectangular 
matrix. 

Fast group decodable codes were introduced in IfTZl . These 
codes combine the properties of multi-group decodable codes 
and the fast decodable codes. These codes allow each of the 
groups in the multi-group decodable codes to be fast decoded. 
The R matrix for a fast group decodable code will have the 
following form: 



R 



Ri 
R 2 








R„ 



(10) 



where each Rj,i = 1,2, ...,<? will have the following form: 



R 



A, B, 
B, 



(ID 



where Aj is an Li x Li block diagonal, upper triangular matrix, 
Bi 2 is a square upper triangular matrix and B^ is a rectangular 
matrix. 



III. Block Orthogonal STBCs 

Block orthogonal codes introduced in O are a sub-class 
of fast decodable / fast group decodable codes. They impose 
an additional structure on the variables conditioned in these 
codes. An STBC is said to be block orthogonal if the R matrix 
of the code has the following structure: 



R 



Ri B 12 
R 2 





Bir 
B 2 r 

Rr 



(12) 



where each R,, i = 1,2, ...,r is a block diagonal, upper 
triangular matrix with k blocks Uii,Ui 2 , ...,Usfc, each of size 
7x7 and By, i = 1,2, T, j = i + 1, T are non-zero 
matrices. 

The low decoding complexity codes described in Section 
HIl utilize the zero entries in the upper triangular matrix R, 
in the breadth first or depth first search decoders such as the 
sphere decoder or the QRDM decoder to achieve decoding 
complexity reduction. The fast sphere decoding complexity 
II2TI of an STBC is governed by the zeros in the upper left 
block of the R matrix and does not exploit the zeros in the 
lower right blocks. The zeros in the lower right block can be 
used to reduce the average decoding complexity of the code 
where the average decoding complexity refers to the average 
number of floating operations performed by the decoder. The 
zeros in the lower right block are also utilized in some non 
ML decoders such as the QRDM decoder J3j or the modified 
sphere decoder lfT31 to reduce the decoding complexity of the 
code. 

A. Design criteria for Block Orthogonal STBCs 

The structure of block orthogonal matrix was defined in 
([T2I 1. In general, the size of block diagonal matrices, R^'s, and 
the upper triangular blocks in these matrices can be arbitrary. 
Similar to [5], we consider only the case that R;S have the 
same size, k x k, and the upper triangular blocks in R^s each 
have the same size 7x7. Hence, a block orthogonal code can 
be represented by the parameters (F, k, 7): 

* V: The number of matrices R; in R; 

> k: The number of blocks in the block diagonal matrix R^ 
- denoted by Uy , 1 < j < k\ 

> 7: The number of diagonal entries in the matrices Uy. 
A set of sufficient conditions for an STBC to be a BOSTBC 

with the parameters (T, k, 1) are described below: 

1) 2-Block BOSTBC: First a condition for the STBC to be 

block orthogonal with parameters (2, k, 1) is given. The case 

for r > 2 will be given subsequently. 

Lemma 1: Q Consider an STBC of size T x N t with 

weight matrices Ai,...,Afe , Bi,...,Bfe. Let 



and Ai = 



-A 



B 



B 



B B 



iup\ 2 Tx2N t 



Bi — [&mp] 



tupl 2 Tx2N t 



l,...,k, 



1, ...2T and p = 1, ...2N t . This STBC has block orthogonal 
structure (2, k, 1) if the following conditions are satisfied: 



{Ai, ...,Ak,Bi,Bk] is of dimension 2k. 
AjAi = I and BjBi = I for % = 1, k. 



A i Aj — - 
and i ^ j. 



-AjAi and BfBj 



-Bj Bi for i,j = 1, k 



T,(p, q ^s,t)es d pqst = for i,j = l,...,k and i ^ j where 

2T 

dpqst — 




v=l J 



and each element (tuple) of § includes four uniquely 
permuted scalars drawn from {1, 2Nt}. 

2) Y -block BOSTBC, T > 2: The set of conditions for an 
STBC to have a block orthogonal structure with parameters 
(r, k, 1) is now given. 

Lemma 2: O Let the R matrix of an STBC with weight 
matrices {A l7 ...,A L } , {B 1; B fc } be 



R 



Ri E 
R 2 



where Ri is a L x L block-orthogonal matrix with parameters 
(r — l,fc,l), E is an L x k matrix and R 2 is a k x k 
upper triangular matrix. The STBC will be a block orthogonal 
STBC with parameters (T, k, 1) if the following conditions are 
satisfied: 

• The matrices {Bx, Bj.} are Hurwitz-Radon orthogonal. 

• The matrix E is para-unitary, i.e., E ff E = I. 

The authors in J5] only discuss the conditions for the block 
orthogonal codes with parameters (T,k, 1). These conditions 
can be easily derived for BOSTBCs with parameters (T, k, 7) 
as well. We first derive the conditions for T = 2. 

Lemma 3: Consider an STBC of size nt x T with weight 
matrices {Ai, A2, A;}, {Bi, B 2 , B;}. Let the R matrix 
for this STBC be of the form 



R 



R : E 
R 2 



where Ri and R 2 are I x I upper triangular matrices, E is an I x I 
matrix. The STBC will have a block orthogonal structure with 
parameters (2, k,j) if the following conditions are satisfied: 

• The matrices {Ai,...,A;} are fc-group decodable with 
7 variables in each group, i.e., {Ai,...,A;} can be 
partitioned into k sets {Si, ...,Sfc}, each of cardinality 7 



H 



A,-A, = for all A, e S m , A, e S, 



such that AiAf 

m =/= n. 

The matrices {Bi,...,Bj} are /c-group decodable with 7 
variables in each group, i.e., {Bi,...,B;} can be parti- 
tioned into k sets {Si, ...,Sfc}, each of cardinality 7 such 
that B. ( Bf + BjBf = for all B t e S m , B^- e S„, 

??i 7^ n. 

The set of matrices {Ai, ...,Aj,Bi, ...,Bj} are linearly 
independent over R. 

The matrix E H E is a block diagonal matrix with k blocks 
of size 7x7. 

Proof: Proof is given in Appendix [A] ■ 



R 



Lemma 4: Let the R matrix of an STBC with weight 
matrices {Ai,...,Al} , {Bi, B;} be 

Ri E 
R 2 

where Ri is a L x L block-orthogonal matrix with parameters 
(r — 1, k, 7), E is an L x I matrix and R 2 is a / x I upper 
triangular matrix. The STBC will be a block orthogonal 
STBC with parameters (T, k, 7) if the following conditions 
are satisfied: 

• The matrices {Bi,...,B;} are fc-group decodable with 7 
variables in each group, i.e., {Bi,...,B;} can be parti- 
tioned into k sets {Si, S^}, each of cardinality 7 such 
that B;Bf + Bj-Bf = for all B^ e S m , B^ g S„, 

m 7^ n. 

m The set of matrices {Ai, Aj,,Bi, ...,B;} are linearly 
independent over R. 

• The matrix E H E is a block diagonal matrix with k blocks 
of size 7x7. 

Proof: Proof is given in Appendix [B] ■ 
B. Effect of ordering on block orthogonality 

We now show that the block orthogonality property depends 
on the ordering of the weight matrices or equivalently the 
ordering of the variables. If we do not choose the right 
ordering, we will be unable to get the desired structure. 

Example 1: Let us consider the Golden code fl4l given by: 



X = 



(13) 



1 a (si + s 2 9) ja (s 3 + S4#) 
y/5 [ a (s 3 + s 4 6>) a (si + s 2 0) 

where 6 = (l + VE) /2, 6 = (l - VB) A a = 1 +3 (1 - 6), 
a = 1 +j (l - 0) and s, = s tI + js iQ for i = 1, 4. 

If we order the variables (and hence the weight matrices) 
as [su, Sxq, s 2 i, s 2 q, S37, s 3Q , S47, S4 Q ], then the R matrix for 
SD has the following structure 



R 



t t t t 



t t t t t 



t 



t 







t t 
t 



t 



t 
t 
t 

t t t 

t 
t t 
t 







where t denotes non zero entries. This ordering of vari- 
ables has presented a (4, 2, 1) block orthogonal structure 
to the R matrix. Now, if we change the ordering to 
[su,S2i,SiQ,s 2 Q,S3i,s i i, S3q,s 4 q], then the R matrix for 
SD has the following structure 

ttOOtttt 



R 



t 

t t 

t 



t 

t t 

t 



t t t t 

t t t t 

t t t t 

t t 



where t denotes non zero entries. This ordering of variables 
has presented a (2, 2, 2) block orthogonal structure to the R 
matrix. We can also have an ordering which can leave the 
R matrix bereft of any block orthogonal structure such as 
[su, siq,s 4 i,s 2 q,s 3 i,ssq,S2i,S4q]. The structure of the R 
matrix in this case will be 

t t t t 



R 



t t 
t t t t 
t t t 



t 
t 

t t 



t t t 

ooootttt 
ooooottt 

t t 
t 



Also note that we have many entries m ^ even when the 
i-th and the j-th weight matrices are HR orthogonal such as 
for cases i = 6,j = 8 and i = 5,j = 8 etc. 

IV. Construction of Block Orthogonal STBCs 

Code constructions for block orthogonal STBCs with vari- 
ous parameters were presented in O. It was shown via simu- 
lations that these constructions were indeed block orthogonal 
with the aforementioned parameters. We provide analytical 
proofs for the block orthogonal structure of some of these 
constructions which include also other well known codes such 
as the BHV code Qol, the Silver code J22) and the Srinath- 
Rajan code l23l . We first study some basics of CUWDs and 
CIODs. 

A. CUWDs and CIODs 

1) CUWDs: QH Linear STBCs can be broadly classified 
as unitary weight designs (UWDs) and non unitary weight 
designs (NUWDs). A UWD is one for which all the weight 
matrices are unitary and NUWDs are defined as those which 
are not UWDs. Clifford unitary weight designs (CUWDs) are 
a proper subclass of UWDs whose weight matrices satisfy 
certain sufficient conditions for g-group ML decodability. To 
state those sufficient conditions, let us list down the weight 
matrices of a CUWD in the form of an array as shown in 
Table U 

TABLE I 
Structure of CUWDs 



Ai 
A 2 



Aa+i 

Aa+2 



l (9-l)A + l 
k -(9-l)A + 2 



All the weight matrices in one column belong to one group. 
The weight matrices of CUWDs satisfy the following sufficient 
conditions for (/-group ML decodability. 

. Ai = I. 

* All the matrices in the first row except Ai should 
square to I and should pair-wise anti-commute among 
themselves. 



• The unitary matrix in the i-th row and the j-th column 

is equal to AjA( 3 -;n_|_i. 
The CUWD matrix representation for these matrices for a 
system with 2° transmit antennas are given below (9). Let 



1 " 




' 


3 




' 1 


-1 




. 3 





, cr 3 = 


-1 



The representations of the Clifford generators are given by: 

R{ lx )=±jaf\ 



R (72fc) = 
■R(72fc+l) 



T 

A 



where k = 1, ...,a. The weight matrices of the CUWD for a 
rate-1, four group decodable STBC can be derived as follows. 
Let a t = jR (724) R (721+1) for i = 1, 2, a - 1. Let A = 
2 a ~ 1 . The weight matrices are now given by 

A x +i =R(l), 

A 2 A+1 = R(j2a+l) , 
A 3A+ l = R(j2a) , 



A jA+fe = A*A 
A fe = I 



fcAjA+1, 
a-1 

ki 

a/ 



(14) 



for j = 1,2,3, k = 1, ..A and where (ki, k 2 , k a ^i) is the 
binary representation of k — 1 . 

2) CIODs: Coordinate interleaved orthogonal designs 
(CIODs) were introduced in ifTTll . 

Definition 4: A CIOD for a system with 2 a transmit anten- 



nas in variables Xi, i = 1, K - 
matrix S (xq, ...,xk-i), such that 



S 



61 (x , 







G 2 



1, K even, is a 2 a x 2 a 



(15) 



2 Ik, ...,X K -1 



where 81 [xo, xk__ 1 j and 82 [xk. , xk-iJ are com- 
plex orthogonal designs of size 2 a ~ 1 x 2 a_1 and xi = 

%il + 3 x (i+K/2)modK- 

B. BOSTBCsfrom CUWDs 

We now show that STBCs obtained as a sum of rate-1, 
four group decodable CUWDs exhibit the block orthogonal 
structure with parameters (2,4, A). 

Lemma 5: Construction I: Let Xi (s\, s 2 , S4\) be a 
rate-1, four group decodable STBC obtained from CUWD 
|[T6l with weight matrices {Ai, A2, A-4\}- Let M be 
an n t x n t matrix such that the set of matrices 
{Ai, A2, ...,A4a,MAi,MA2, ...,MA4a} are linearly indepen- 
dent over R. Then the STBC given by 

X (si, s 2 , s 8A ) = Xi (si, s 2 , s 4A ) 

+ M.Xi («4A+1, S4 A+ 2, — , Ss\) , 



will exhibit a block orthogonal structure with parameters 
(2, 4, A). 

Proof: Proof is given in Appendix [C] ■ 
Example 2: Let us consider the BHV code given by: 



and M as 



where Xi 

Xi (si,s 2 ) 



X = Xi (si, s 2 ) + TXi (zi,z 2 ) , 

and Xi take the Alamouti structure, 

" *i -*a 1 T [ 1 

s 2 s{ ' -1 



and 
and 



[21,22] = U[s3,S4] , where U is a unitary matrix chosen 
to maximize the minimum determinant. In this case, as per 
the above construction, M = TU. Hence, the BHV code is a 
BOSTBC with parameters (2,4, 1). 

C. BOSTBCsfrom Cyclic Division / Crossed Product Algebras 

In this section, we show the block orthogonality property 
of two constructions from either cyclic division algebras or 
crossed product algebras over the field Q (i). 

Lemma 6: Construction II: Let X be an STBC with weight 
matrices {Ai,...,Art} and {Bi,...,B^} for the variables 
[xu ... xki] and [x\q ... xkq] respectively such that = 
jAi for 1 < i < K. Then the code X exhibits the block 
orthogonal property with parameters (K, 2,1) if we take the 
ordering of weight matrices as {Ai,Bi, ...,Ak,Bk}. 

Proof: Proof is given in Appendix [D] ■ 

Example 3: Consider any STBC obtained from the Cyclic 
Division Algebra (CDA) ITU over the base field <Q> (i). The 
structure of such an STBC will be 

X 7Cr(x„_i) ••• 
xi a (x ) 



X 



70— (xi) 
7a"- 1 (x„_ a ) 



-1 cr(x„_ 2 ) 



a"- 1 (x Q ) 



where Xk = Xki + jxkQ- The weight matrices of this STBC 
satisfy the properties of the construction above. Hence, this is 
a BOSTBC with parameters (n, 2, 1). 

The next construction is a special case of the previous 
construction. 

Lemma 7: Construction III: Let Xi be a two group 
decodable STBC with weight matrices {Ai, Ak} and 
{Bi, ...,Bk} for the variables [x\ ... xk] and [x_r-+i •■• %2k] 
respectively such that B^ = jAi for i = 1, ...if. 
Let M be a matrix such that the matrices in the set 
{Ai, A2, Ak, MAi, MA2, ...,MAk} are linearly indepen- 
dent over R. Then the STBC given by 

X (xi,X 2 , —,X4,k) = Xi (xx,X 2 , -,X2K) 

+ M.Xi (x 2K +l,X 2K + 2 , Xak) , 

will exhibit a block orthogonal structure with parameters 
(2,2, JQ. 

Proof: Proof is given in Appendix [E] ■ 
Example 4: Consider the golden code as given in example 
[U If we consider, 

J_\ a (si + s 2 9) _0 
y/5[ a( Sl + s 2 9) 



Xi 



M 



3 

1 



we can see that the golden code is a BOSTBC with parameters 

(2,2,2). 

D. BOSTBCsfrom CIODs 

In this section we show that the BOSTBCs that can be 
obtained from CIODs ifTTl . 

Lemma 8: Construction IV: Let Xi (si, s 2 , Sk ) be 
a rate-1 CIOD with weight matrices {Ai, A2, Ak}. 
Let M be a matrix such that the set of matrices 
{Ai, A2, Ak ,MAi,MA2, ...,MA^} are linearly indepen- 
dent over R. Then the STBC given by 

X (si, s 2 , s 2K ) = Xi (si, s 2 , sk) 

+ MXi (s k +i,sk+ 2 , — , s 2K ) , 

will exhibit a block orthogonal structure with parameters 

(2, K/2, 2). 

Proof: Proof is given in Appendix [F| ■ 
Example 5: Consider the 2x2 code constructed by Srinath 
et al. in l23l given by 



X = 
If we consider. 

Xj 

and M as 



xu +jx 2Q 
e^/ 4 (x 4I +jx 3Q ) 



W4 (x 3 i + jx 4Q ) 

X 2 I + JXiQ 



XlI+jX 2 Q 

x 2I + jxiQ 



M 



e J7r / 4 
e W4 



we see that the code is a BOSTBC with parameters (2, 2, 2). 

V. Reduction of Decoding complexity for Block 
Orthogonal Codes 

In this section we describe how we can achieve decoding 
complexity reduction for BOSTBCs. Also we show how 
the block orthogonal structure helps in the reduction of the 
Euclidean Metric (EM) calculations and the sorting operations 
for a sphere decoder using a depth first search algorithm. We 
also briefly present the implications of the block orthogonal 
structure for QRDM decoders as discussed in O. 

A. ML decoding complexity reduction 

The sphere decoder under consideration in this section will 
be the depth first search algorithm based decoder with Schnorr- 
Euchner enumeration and pruning as discussed in |;13j. We first 
consider the case of T = 2 Block Orthogonal Code. 
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Fig. 1. First two levels of the sphere decoder tree for the code in example 
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1) r = 2: Consider a BOSTBC with parameters (2, £,7). 
The structure of the R matrix for this code is as mentioned 
in dTZb with two blocks Ri and R2. This code is fast sphere 
decodable, i.e., for a given set of values of variables in sub- 
blocks U2.J, j = l,...,k, we can decode the variables in 
Uij and Ui j, 1 < j < I < k, independently. The ML 
decoding complexity of this code will be O (Af fc7+7 ). Due 
to the structure of the block orthogonal code, we can see that 
the variables in the blocks U2.J and U2,j, 1 < j < I < k, are 
also independent in the sense that the EM calculations and the 
Schnorr-Euchner enumeration based sorting operations for the 
variables in U2 j are independent of the values taken by the 
variables in U2,z- We illustrate this point with an example. 

Example 6: Consider a hypothetical BOSTBC having the 
parameters (2,2,1) with variables {xi,X2,Xs,x^}. The R 
matrix for this BOSTBC will be of the form 



R 



The first two levels of the search tree for the sphere decoder 
are shown in in Figure Q] with the variables assumed to be 
taking values from a 2-PAM constellation - A. As it can be 
seen from the figure, irrespective of the value taken by X4, the 
edge weights (Euclidean metrics) for the variable x% remain 
the same. 

From example [6] we can see that instead of calculating the 
EM repeatedly, we can store these values in a look up table 
when they are calculated for the first time and retrieve them 
whenever needed. This technique of avoiding repeated calcu- 
lations by storing the previously calculated values is known 
as Memoization 11241 . This approach reduces the number of 
floating point operations (FLOPS) significantly. 

2) T > 2: Consider a BOSTBC with parameters (r, £,7). 
The structure of the R matrix for this code is as mentioned 
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in (TT2l) . Consider the block R;, 1 < i < T of the R matrix. 
For a given set of values for the variables in the blocks R m , 
m > i, we can see that the variables in the blocks U;j and 
Ui j, 1 < j < I < k, are independent as seen in the case 
of r = 2. Hence, we can use memoization here as well in 
order to reduce the number of EM calculations and sorting 
operations. 

B. Complexity reduction bound and Memory requirements for 
depth first sphere decoder 

We calculate the maximum possible reduction in the number 
of EM values calculated and the memory requirements for the 
look up tables in this section. First we consider the case of 

r = 2. 

1) r = 2: Considering a (2, A;, 7) BOSTBC, we first 
calculate the memory requirements for storing the EM values. 
Let each of the variables of the STBC take values from a 
constellation of size M. The number of EM values that need 
to be stored for a single sub-block U2J, 1 < j < k, is 

Mem (U 2j ) = M + M 2 + ... + A/ 7 

_ M 7+1 — M _ M (A/ 7 - 1) 
~ M - 1 ~ M- 1 ' 

These values will need to be stored for (k — 1) such sub- 
blocks. The total memory requirement for the block R2 is, 

,„ ■< x M(NP - 1) 

Mem R 2 = (k - 1 -. 

v 2 ' v ; M -1 

We now find the maximum number of reductions possible 
for the EM calculations for this BOSTBC. This will occur 
when all the nodes are visited in the depth first search. For the 
block R 2 , the number of EM calculations for a code without 
the block orthogonal structure would be 

M (M fe7 



Ostbc =M + M' 



M k ~< = 



1 



M - 1 

For a BOSTBC, if we use the look up table, we would be 
performing the EM calculations only once per each of the 
sub-block. For k sub-blocks, the number of EM calculations 
will be 



O 



BOSTBC 



k (M + M 2 + 

M (M 7 - 1) 



Af 7 ) 



k 



M - 1 

We therefore perform only a small percentage of EM calcula- 
tions if the code exhibits a block orthogonal structure. We call 
the ratio of the the number of EM calculated for a BOSTBC 
to the number of EM calculated if the STBC did not possess 
a block orthogonal structure as Euclidean Metric Reduction 
Ratio (EMRR) given by 

k M(AP-l) 



o 



BOSTBC 

Ostbc 



M-l 



M(M k ~<-l) 
k 



k (Af 7 - 1) 
(M k '< - 1) 



which is a decreasing function of k, M and 7. 



2) T > 2: Considering a (T,k,~f) BOSTBC, we first 
calculate the memory requirements for storing the EM values. 
The memory requirement per sub-block Uj 3 -, 1 < j < k, of 
any block R,, 1 < j < T, under consideration is the same as 
that of the case of the sub-block U2J in the r = 2 case. This 
is so because, for a given set of values for the variables in 
the blocks R m , i < m < T, the memory requirement for the 
sub-block Ujj can be calculated in the similar way as it was 
calculated for \J2.j for the T = 2 case. Hence, the memory 
requirements for a block R^ for a given set of values for the 
variables in the blocks R„, is the same as that of R 2 in the 
r = 2 case. 



M em (R. 



1 ) conditional 



(fc-i) 



M(M 7 - 1) 
M - 1 ' 



We can reuse the same memory for another set of given values 
of the variables of R TO , as the previous EM values will not 
be retrieved again as the depth first search algorithm does not 
revisit any of the previously visited nodes (i.e., any previously 
given set of values for the variables in the tree). Hence, we 
can write, 



Mem(R t ) = (k - 1) 



M (M 7 - 1) 
M — 1 : 



for 1 < i < r. Since there are T — 1 such blocks, the total 
memory requirement for storing the EM values will be 



Mem (R) = (T - 1) (k - 1 



M (M 7 - 1) 
M-l ' 



We now find the maximum number of reductions possible 
for the EM calculations for this BOSTBC. This will occur 
when all the nodes are visited in the depth first search. For 
blocks other than Ri, the number of EM calculations for a 
code without the block orthogonal structure would be 

Ostbc = M + M 2 + ... + M {T -^ k ~i 
M (M^- 1 ^ - 1) 
~ M-l ' 

For a BOSTBC, if we consider the block R^ and for a given 
set of values for the variables in R m , i < m < V, if we use the 
look up table, we would be performing the EM calculations 
only once per each of the sub-block. For k sub-blocks, the 
number of EM calculations will be 

ObOSTBC {^conditional = k {M + M 2 + ... + M 7 ) 

M (M 7 - 1) 



k- 



M - 1 



These calculations need to be repeated for all the M^ r ^ fc7 
values of the variables in R m . 

Obostbc (Ri) - kM < r - 4 > fe7 (M + M 2 + ... + M 7 ) 

M - 1 



The EM calculations for all the blocks is given by 
r 

Obostbc 



VfcM (r-^M(M 7 -l) 
M- 1 



i=2 



fcM(M 7 - 1) M^ -1 ^ 7 - 1 



M- 1 

The EMRR in this case will be 



M k i - 1 



kM(M'<~l) j\/ (r ~ 1)fc7 -l 

Obostbc m-x w-i-i 



O 



STBC 



M(M( r ~ 1 )''T-l) 
M-l 

fc(M 7 -l) k 



(M fe7 -1) M^" 1 ) 7 ' 

We can see that the ratio of the reduction of operations is 
independent of T and dependent only on k and 7. 

C. QRDM decoding complexity reduction 

In this section we review the simplified QRDM decoding 
method which exploits the block orthogonal structure of a 
code as presented in Q. The traditional QRDM decoder is 
a breadth first search decoder in which M c surviving paths 
with the smallest Euclidean metrics are picked at each stage 
and the rest of the paths are discarded. If M c = M ( r_1 ) fc7 
for a block orthogonal code with parameters (r, k, 7), then 
the QRDM decoder gives ML performance. The simplified 
QRDM decoder utilizes the block orthogonal structure of the 
code to find virtual paths between nodes, which reduces the 
number of surviving paths to effectively M Ce , to reduce the 
number of Euclidean metric calculations. For details of how 
this is achieved, refer to 0. The maximum reduction in 
decoding complexity bound for a QRDM decoder is given 
by 

Obostbc _ 
Ostbc ~fc(M 7 -l)' 

VI. Simulation Results and Discussion 

In all the simulation scenarios in this section, we consider 
quasi-static Rayleigh flat fading channels and the channel state 
information (CSI) is known at the receiver perfectly. Any 
STBC which does not have a block orthogonal property is 
assumed to be a fast decodable STBC which is conditionally k 
group decodable with 7 symbols per group, but not possessing 
the block diagonal structure for the blocks R2 , • • ■ , Rr ■ 

A. Sphere decoding using depth first search 

We first plot the EMRR for BOSTBCs with different 
parameters against the SNR. Figures |2] and [3] show the plot 
of Obostbc /Ostbc vs SNR for a (2,4, 1) BOSTBC (ex- 
amples - Silver code, BHV code) with the symbols being 
drawn from 4-QAM, 16-QAM and 64-QAM. We can clearly 
see that the reduction in the EMRR with the increasing size 
of signal constellation as explained in section IV-BI It can 
also be seen that a larger value of k gives a lower EMRR 
if we keep the product kj constant. Figure H] shows the 
plot of Obostbc /Ostbc vs SNR for a (2,4,2) BOSTBC 
(examples -4x2 code from Pavan et al l23l ) with the 
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Fig. 2. EMRR vs SNR for a BOSTBC with parameters (2, 4, 1) 




5 10 15 20 25 30 



SNR (dB) 

Fig. 3. EMRR vs SNR for a BOSTBC with parameters (2, 2, 2) 



symbols being drawn from 4-QAM and 16-QAM. Notice that 
the (2, 4, 2) BOSTBC offers a lower EMRR as compared to the 
(2,4, 1) BOSTBC due to the higher value of 7, as explained 
in section [V-Bl 

We now compare the total number of FLOPS performed 
by the sphere decoder for a BOSTBC against that of an 
STBC without a block orthogonal structure for various SNRs. 
Figures |U |6] show the plot of number of FLOPS vs SNR 
for a (2,4,1) BOSTBC, a (2,2,2) BOSTBC and a (2,4,2) 
BOSTBC respectively with the symbols being drawn from 4- 
QAM, 16-QAM and 64-QAM for the first two figures and 
from 4-QAM and 16-QAM for the last one. We can see that 
the BOSTBCs offer around 30% reduction in the number of 
FLOPS for the (2, 4, 1) and (2, 4, 2) BOSTBCs and around 
15% for the (2, 2, 2) BOSTBC at low SNRs. 
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Fig. 4. EMRR vs SNR for a BOSTBC with parameters (2, 4, 2) 
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Fig. 5. The number of FLOPS required for decoding for a BOSTBC with 
parameters (2, 4, 1) 



B. Comparison with the QRDM decoder approach 

The primary difference between the depth first and the 
breadth first (QRDM) approach is the variation of the EMRR 
with respect to SNR. As seen in the figures from section [VI-AI 
the effect of the block orthogonal property reduces as the SNR 
increases in the depth first sphere decoder. This is owing to 
the Schnorr-Euchner enumeration and pruning of branches. As 
the SNR increases, the decoder needs to visit fewer number of 
nodes in order to find the ML solution and hence the EMRR 
also tends to 1 . However, in the case of a breadth first search 
algorithm, all the nodes need to be visited in order to arrive 
to a solution. Hence the EMRR is independent of the SNR in 
the breadth first search case. To reduce the number of nodes 
visited, only M c paths are selected in the QRDM algorithm 
to reduce complexity. The value of M c chosen needs to be 
varied with SNR in order to get near ML performance. 



# FLOPS for 4-QAM without BO structure 

# FLOPS for 4-QAM with BO structure 

# FLOPS for 1 6-QAM without BO structure 

# FLOPS for 1 6-QAM with BO structure 

# FLOPS for 64-QAM without BO structure 

# FLOPS for 64-QAM with BO structure 




Fig. 6. The number of FLOPS required for decoding for a BOSTBC with 
parameters (2,2,2) 
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Fig. 7. The number of FLOPS required for decoding for a BOSTBC with 
parameters (2,4,2) 



VII. Conclusion 

In this paper we have studied the block orthogonal property 
of STBCs. We have shown that this property depends upon the 
ordering of weight matrices. We have also provided proofs of 
various existing codes exhibiting the block orthogonal prop- 
erty. A method of exploiting the block orthogonal structure of 
the STBCs to reduce the sphere decoding complexity was also 
given with bounds on the maximum possible reduction. 
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Appendix A 
Proof of Lemma[3] 

Following the system model in Section [II] we have the 



equivalent channel matrix H 
[Hi H 2 ] = [hi ... h, h i+ i 



eq ^ 

h 2 i] 



H, 



H^H 2 



are 
Under 



We know from Theorem 
2 of |23l that, if any two weight matrices A, and Aj are 
Hurwitz-Radon orthogonal, then the i-th and the j-th columns 
of the H eq matrix are orthogonal. Due to the conditions on 
the weight matrices, we have that H^Hi and 
block diagonal with k blocks, each of size 7 x 
QR decomposition, H eg = QR with Q = [Q x Q 2 ] with 

Ri E 
R 2 j 

can be seen from Lemma 2 of llzTI that the matrix Ri is block 
diagonal with k blocks, each of size 7x7. We can now write, 

H 2 =QiE + Q 2 R 2 , 



Q1Q2 G 



v2n r nt xl 



and R 



as mentioned. It 



Appendix C 
Structure of the R matrix obtained from 
Construction I 

According to construction I, the structure of the STBC is 

X = Xi (si, s 2 , s^) + MX 2 (s 4A+ i, S4A+2, — , s 8 \) , 

where Xi is a rate-1 four group decodable STBCs obtained 
from CUWDs as described in Section IIV-A1I 

Let the R matrix for this code have the following structure: 



R 



Ri E 
R 2 



where Ri, E and R 2 are 4A x 4A matrices. 



(Ha 

Simplifying, 



QiE) T (H a 



QiE) 



R^Q^Q 2 R 2 



H^H 2 



E T E 



R 2 P R 2 



A. Structure of Ri 

From I2TI . it can be easily seen that Yi has a block diagonal 
structure with four blocks, and each block of the size A x A. 



Now, if E T E is block diagonal with k blocks of size 7x7 
each => R^R 2 is block diagonal with k blocks of size 7x7 
each. Since R 2 is upper triangular, this means that R 2 is block 
diagonal with k blocks of size 7x7 each. 



Ri 



Rn 
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Appendix B 
Proof of Lemma@] 

Following the system model in Section [II] we have the 



equivalent channel matrix H eq G 



p2n r n t xL-\-l 



H 



hi ... h£ h^ +1 ... h/, + ;]. We know from 
that, if any two weight matrices and 



[Hj H 2 ] = 
Theorem 2 of 

Rj are Hurwitz-Radon orthogonal, then the i-th and the j- 
th columns of the H er? matrix are orthogonal. Due to the 
conditions on the weight matrices, we have that H 2 r H 2 is 
block diagonal with k blocks, each of size 7x7. Under 



QR decomposition, 

a2n r nt X L 



H eq = QR with Q = 

Ql S ]&^rn t XL an( j Q 2 g R 2n r n t xl an( j R 

as mentioned. We can now write, 



Qi Q 2 ] with 
Ri E 
R 2 



where Ri^, i = 1, ...4 is a A x A given by dl6l) , 

Proposition 1: The non-zero blocks of the matrix Ri are 
equal i.e., Rn = Rij, for i = 2,3,4. 

Proof: It is sufficient for us to prove that 



r i II -II r 4(i-l)A+j 



and 



(q 3 ,h fc ) = (q 4 ( 4 -i)A+j 1 h 4(i-i 



)A+fe / j 



(17) 



(18) 



for i = 2,3,4, j = 1,..., A - 1 and k=j + 1, A. 

The proof is by induction. We first consider the case of 
j = 1. We also recall ll23l that 



H 2 =QiE + Q 2 R 2 , 
(H 2 - QiE) T (H 2 - Q]E) = R 2 r Q 2 P Q 2 R 2 



Simplifying, 



H 2 P H 2 



E T E 



R 2 r R 2 



Now, if E E is block diagonal with k blocks of size 7x7 



each 



R 2 R 2 is block diagonal with k blocks of size 7x7 



each. Since R 2 is upper triangular, this means that R 2 is block 
diagonal with k blocks of size 7x7 each. 



Now, for ( [T7| ) we have, 
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r 4(i- 



1)A+1 II (! t Ll(i-l)A+lil 1 4(i~l)A+2 

J II r 4 (i-i)A+2 II 
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v Q4(i-l)A+2' h 4(j-l)A+A / 

II r 4(i-l)A+A I 



(16) 



since Ta(4- 



4(i-1)A+1 



1, ...,4A. For (fl~8T > we have, 
1 



1 4(i-l)A+l 



and AfcA fc = I for k = 



(hi, life) 
tr ( HAiATh T 



2 II r x || 

tr (HA 4(i _ 1)A+1 A 4(l _ 1)A+1 A fe H 



2 || r4(«-i)A+i I 

/ ~ ~ ~ T ~1 

tr ^HA 4 ( i _ 1 ) A+1 A 4(i _ 1)A+fe H 
2 || r 4(l _i) A+ i || 

q4(i-l)A+lih4(i-l)A+fc) , 



since AfcA 4 ( i _ 1 ) A+1 = A 4 ( i _ 1 - )A+fc . Now we prove equations 
( TTtT > and < TT~8T > for arbitrary j. We prove this by induction. Let 
the equations hold true for all I < j. We now have for equation 



I 3-1 j-1 

= ( h J - Z h ^ q ' ' h J ~ ( q fe' h j) q * 

\ i=i fc=i 

j-i 

= (h j ,h i )-2^(q ; ,h i ) 2 

j'-i i-i 
&=i i=i 

= -*r (hA.-aJh 7 ) -2^(q i; h,) 2 

i-i i-i 

fe=i j=i 

1/-- - T ~ T\ 

= 2 tr ( HA 4(j-i)A+jA 4(l _ 1)A+j H 

~ 2 Z \l4(j-l)A+;' h 4(i-l)A+j 

;=i 

3-1 3-1 



4(i-l)A+j 

fc=l i=l 

l4(i-l)A+fci ^4(i-l)A+j ) ( l4(i-l)A+ii l4(i-l)A+fc 



which follows from the induction hypothesis and the fact that 
AjAj = I for j = 1, 4A . For equation (1 1 St . 



1 / i_1 

( q 3' hfe ) = ( ^ ~ Z (^' h j) 9j ' hfc 



3-1 



z=i 



2 II r. 



3-1 



tr (haXh t ) -E(q^ h 3-)(q ;! hA 



/=1 



- T - T\ 

tr (HA 3 A 4(j _ 1)A+1 A; ( , t _ 1)A+1 A fe H J 
2 || r 4(i _ 1)A+J |j 

Z (q4(i-l)A+i) h 4(i-l)A+j 



2 || r 4(i _i )A+J n ;=i 

( t l4(i-l)A+iJ ,1 4(i-l)A+fe^ 



r T 



tr (HA 4(l _ 1)A+: ,A 4(i _ 1)A+fc H 



2 || r 4 (j-i)A+j 
1 



2 II r 4(i-l)\+j || |j \ Q4( 
(q4(i-l)A+i' h 4('i-l)A+fc^ 
(<l4(i-l)A+3> h 4(i-:L)A+fc^ • 



i-l)A+/' n 4(i-l)A+i 



5. Structure of E 

The matrix E is key for the block orthogonality property 
of the STBC in question. It is required to be para-unitary 
for achieving this property. The structure of the matrix E for 
Construction I is described in the following proposition. 

Proposition 2: The matrix E is of the form 



E 



Ei 


— E2 


-E 3 


— E4 


E-2 


Ei 


-E 4 P 


E 3 P 


E 3 


E 4 P 


Ei 


E 2 P 


E4 


-E 3 P 


E 2 P 


Ei 



(19) 



r 4(i-l)A+j II , 



where E,, i = 1, ...,4 are A x A matrices and P is a A x A 



permutation matrix given by 



and the j-th row of E« is given by 







1 

1 



1 

1 







Em (j, k) — ^q 4 ( l _i)A+ :) - ! h4A+4(i-l)A+fc^ 

(h 4 (i-i)A+j 



1 



En 


E12 


E13 


E14 


E21 


E22 


E23 


E24 


E31 


E32 


E33 


E34 


E41 


E42 


E43 


E44 



Proof: Let us represent the matrix E using A x A blocks 

as: 



E 



We first prove that En = Ejj for i = 2,3,4. The proof 
is by induction on the rows of the matrix En. The first row 
entries of the matrix En are given by 



En (l,fc) = (qi,h 4A+fc ) 
and for the matrix E^ are given by 



•"4(i-l)A+j II 

j-1 

X! (q4(j-l)A+m'^4(i-l)A+j^ Q4(i-l)A+m' 
m— 1 

h 4 A+4(i-l)A+fc 



1 



r 4(i-l)A+j 
1 



) 

(h4(i-l)A+j ! h4A+4(i-l)A+fc) 



J- 1 , 

X] \^4(i-l)A+m'^4(i-l)A+i 



r 4(*-l)A +J „ m=1 
Q4(i-l)A+m! h4A+4(j-l)A+fc 

tr (HA 4(? _i )A+J Aj (j _ 1)A+fc M T H T 



1 



2 || r 4(l _i )A+ j || 



2 || r 4(i _i )A+i || m=1 



X] (^m> h j) (qm: h 4A+i 



ha j a 4( ._ 1)a+1 a4 (i _ 1)a+1 a[m t h t 



Ejj (l,fc) — ^q 4 ( 4 _i) A+ i,h 4A+4 ( i _i) A+fe 

(h4(i-l)A+l, h4A+4(i-l)A+fe) 
II l"4(j-l)A+l || 

Due to the construction of the STBC, we have A 4A +; = MA;, 
for 1 = 1, ...,4A. Using this, we get 



1 i_1 

— M X! (q mJ h j) (q m > h 



4A+A-/ 



m — 1 



2ir^ (hj ' h4A+fc> 



1 

— ji X! (q m) h i)(q 



E, 4 (l,fc) 



ir- ^HA 4(4 _i )A+ iA 



4(i-l)A+fe 



M H 



2 || r 4 (j_i) A+ i || 
HA 4(i _i )A+ iA 4(i _ 1) A+i A fc MT H T 



2 r 



4(j-l)A+l 



tr f HAiA fc M H 

(q 1; h 4A+fc ) 
E u (l,fc). 



Now, let us assume that row m of Ej, is equal to the row m 
of En for all m < j. The j-th row of En is given by 



4A+fe/ 

1 i II ' — ' 

J m — 1 

= E n (j,k). 

We now prove that E2 = E21 = — E12 = E 4 3P = — E34P. 
The proofs for the matrices E3 and E 4 are very similar. First 
step is to prove that E21 = — E12. The proof is by induction 
on the rows of the matrix E21. The first row entries of the 
matrix E21 are given by 

E21 (l,fc) = (q A+ i,h 4A+fc ) 



1 ~ T ~ T ~ T 

tr (HA A+ iA A , fe M H 



En = (q j; h 4A+fe ) , 



2 II ri 

and for the matrix E12 are given by 

E12 (l,fc) = (qi,h 5A+fe ) 

= I, ij <hi,h 5A+fc ) 



Due to the construction of the STBC, we have A4A+/ = MA;, 
for 1 = 1, ...,4A. Using this, we get 



Ei 2 (l,fc) 



tr ( HAiA A , fe M H 



2 || ri 
1 



tr ( HAiA A+1 A fc M H 



HA A+ iA fc M H 



2 II ri 

= (<h+l> h 4A4-fc) 
= E 21 (l,fc). 



Now, let us assume that row m of E21 is equal to the row m 
of E12 for all m < j. The j-th row of E21 is given by 

E21 (j,k) = (q x+j ,h 4 \+k) , 

and the j-th row of Ei 2 is given by 

E12 (j,k) = (q^hsA+fc) 

_ (hj,h 5X k) - Emii (q m .hj> (q m ,h 



5A+fc/ 



2 II r A 



II II 



2 II r x+J 



- T - T - T - T 



5A+fc/ 



j-1 



1 

2 II r A+i || ^ 

1 - T ~ T ~ T 

tr (HA A+ jA fe M H 



2 II r A+ , 



1 

: ~7| X! (lA+m> h A+i) (qA+m7 h 4A+fc) 



2 


rx+j II 




1 


2 


r\+j II 




1 


2 


r A+J || 



m— 1 

(hA+j, h4A+fc) 

i-i 



= E21 (i, fc) . 

We now prove that E12 = E43P. The proof is by induction 
on the rows of the matrix Ei 2 . The first row entries of the 
matrix Ei 2 are given by 

E12 (l,k) = (q^hsA+fe) 

= 71 M (hl>h5A+fe) • 

II r l II 



Due to the construction of the STBC, we have A4A+/ = MA;, 
for 1 = 1, ...,4A. Using this, we get 

-tr (HA 1 A a+a .M H 



Ei 2 (l,fc) 



2 II ri 
1 



■tr I HA A+1 A fc M H 



2 II ri 

We need to show that this is equal to — E43 (1, A — k + 1). 

E43 (1, A - k + 1) = (q 3A+1 ,hiiA-fe+i) 
1 

= (h3A+l, huA-jt+l) 



T3A+1 

(' 



tr(HA 3 A+iA^_ fc+1 M T H T 



2 II || 

tr ( HA 3 A+iA[ A+1 A A _ fe+ iM T H T 



(' 



2 II r! || 

tr (HA 3 A + iA^ +1 A A _ fc+1 M T H T 



2 II r x || 

Substituting the values of the weight matrices from ( TPfl ) for 
A A+ i, A2 A +i and A3 A+ i, and simplifying, we see that it is 
sufficient to show that 

(ir- i (g)^ 3 )AL, +1 =,^A feJ 

or equivalently, 



lf a - 1 <^)ja 3 )Al_ k+1 Al=jaf a . 



Since A — k + 1 and k are one's complement of each other in 
the binary representation, we have, 



®a-l 



Ha 



a-1 



A A _ fc+ iA fc = Y[ oti = A A = jo 
Therefore we have, 



(if*" 1 <g)jo 3 ) A A = (if- 1 ja 3 



The equality for E3 and E4 can be shown similarly. ■ 

C. Structure of R2 

Proposition 3: The matrix R2 is block diagonal with 4 
blocks, each of size A x A. 

Proof: For the matrix R2 to be block diagonal with 4 
blocks, each of size A x A, we need to satisfy the following 
conditions 

• The matrices {MAi, MA2, MA4 A } form a four group 
decodable STBC with A variables per group 

• The matrix E is such that E T E is block diagonal with 4 
blocks, each of size A x A. 

Since the matrices {Ai, A 2 , A4 A } form a four group 
decodable STBC with A variables per group, it is 
easily seen that the matrices {MAi, MA2, MA4 A } 



also form a four group decodable STBC with A vari- 



ables per group as (MA,) (MAj) H + (MAj) (MA,) 



M 



A,A 



H 



-A, A 



H 



M 11 = for i and j in different groups. 

We now introduce some notation before we address the 
structure of the matrix E H E. Let m be an integer such that 
1 < m < A. We denote by / (rn), the binary representation 
of m — 1 using a — 1 bits. Let denote the bitwise XOR 
operation between any two binary numbers. 

Now, we turn to the structure of the matrix E. From Propo- 
sition HI we know the structure of the matrix E. Computing 
E T E, we see that for it to be block diagonal with 4 blocks, 
each of size A x A, it is sufficient to show that the matrices 
E^Ej are symmetric with identical entries on the diagonal for 
i,j = 1, 4, i 7^ j. The entries of Ej Ej are given by 



EfEj (k,l) = (q m > h 4A+fc) (q A+m ,h 4A+ ;) . 

m— 1 

Expanding and simplifying, we get 

x x 

a,nn (h m ,h4A+fc) (hA+n,ll4A+z) , 

m— 1 n— 1 

where a mn = a tl t = / _1 (/ (m) / (n)) and a t is given by 

-J2l=\ a p(qA-t+i> h A-p+i) 



a t = 



TA-t+l 



for t = 2, 3, .., A and ai = j[p^p-- We now see that for every m, 
there exists a unique m such that (h m , hi\+k) = (h m < , Iua+j) 



as 



(h m ,h 4A+fc ) = tr ( HA m A[ +fc M T H T 



tr I H 



A x+l M H 



(h m /,h 4 A+j) 



where m = / 1 (/ (m) ® f (k) ® f (I)). Similarly, for ev- 
ery n, there exists a unique n such that (hA+njlUA+j) = 
(h x+n , , h 4 A +fc ) where »' = /- 1 (/ (n) ®f(k)®f (I)). We 
can now write, 

EfEj (k, = EE a ™'«' ' h4A +') ( h A+r/ . h 4A+fc) 

m n 

= Ef E, (/, k) , 

if a m '„' = a mn . Let a TO '„- = a t /. t is given by, 
*' = f~ l (f (rn) ® / (fc) © / (0 © / (n) © / (fc) 8 / (/)) = 
/ _1 (/ ( m ) © / ( n )) = t- Therefore, we can see that Ej Ej 
is symmetric. Using the above arguments, it is also easly seen 
that the diagonal elements of the matrix Ej Ej are identical. 

Hence, we have shown that the matrix R2 is block diagonal 
with 4 blocks, each of size A x A. 



Appendix D 
Structure of R matrix obtained from 

CONSTRUCTION II 
The STBC X can be written as 

K 

X = S XjAj, 

i=l 

where Xi = xu+jxjQ. Tweaking the system model in section 
HH we can get a generator matrix for this STBC as 



G = [uec(Ai) wec(A2) 
Hence, ([T} can be written as 



vec (Ak) 



vec (Y) 



H e? x 



vec (N) , 



where H eq e C n - ntXK is given by ll' eq = (I„ t ®H)g', and 
x = [xi,x%..., Xk\ , with each Xi drawn from a 2-dimensional 
constellation. It can be easily seen that H eq = H e? . 

Let the QR decomposition of the complex matrix H eq yield 
matrices Q and R . Using the relation: If A = BC, then 
A = BC, we can see that R = R . The QR decomposition 
of a complex matrix yields a unitary Q matrix and an upper 
triangular matrix R with real diagonal entries. Hence, the 
diagonal entries of the matrix R are real. Since R = R , we'll 
have R (2i - 1, 2i) = for i = 1, ...K. Hence, the STBC X 
exhibits a block orthogonal property with parameters (K, 2,1). 

Appendix E 
Structure of R matrix obtained from 

CONSTRUCTION III 
Let the R matrix for this code have the following structure: 



R 



Ri E 
R 2 



where Ri, E and R2 are 2K x 2K matrices. 

From ETl . it can be easily seen that Ri has a block diagonal 
structure with two blocks, and each block of the size K x K. 



Ri 



Rn 

R12 



where Rn and R 42 are K x K upper triangular matrices. 

Proposition 4: The non-zero blocks of the matrix Ri are 
equal i.e., Rn = R12. 

Proof: Proof is similar to the proof of Proposition Q] ■ 

The structure of the matrix E is described in the following 
proposition. 

Proposition 5: The matrix E is of the form 



E 



Ei — E2 
E 2 Ei 



where Ei, i = 1, ...,4 are K x K matrices. 

Proof: Proof is similar to the proof of Proposition [2] ■ 
Proposition 6: The matrix R2 is block diagonal with 2 
blocks, each of size K x K. 

Proof: Proof is similar to the proof of Proposition [3] ■ 



Appendix F 
Structure of R matrix obtained from 

CONSTRUCTION IV 

As only rate-1 CIODs are considered in this construction, 
this can only be done for either 2x2 CIODs or 4 x 4 CIODs. 
The structure of the R matrix obtained from the 2x2 CIOD 
is the same as the structure of R matrix obtained from the 
construction III. The proof of the structure is also the same as 
given in Appendix [F] We now consider the structure of the R 
matrix obtained from using a 4 x 4 CIOD. Let the R matrix 
for this code have the following structure: 



where R l5 E and R 2 are 8 x 8 matrices. 

From l2ll . it can be easily seen that Ri has a block diagonal 
structure with 4 blocks, and each block of the size 2x2. 



Ri = 



Rn 











R12 











R 13 








R14 



where Ri; are 2x2 upper triangular matrices for i = 1, 4. 



Proposition 7: The non-zero blocks of the matrix Ri are 
such that Rn = R12 and R13 = R14. 

Proof: Proof is similar to the proof of Proposition Q] ■ 

The structure of the matrix E is described in the following 
proposition. 

Proposition 8: The matrix E is of the form 



E = 



Ei 


— E 2 


E, 


-E 6 


E2 


Ei 


E 6 


E 5 


E3 


-E 4 




—Eg 


E4 


E 3 


Eg 


E 7 



where Ej, i = 1, 8 are 2 x 2 matrices. 

Proof: Proof is similar to the proof of Proposition [2] ■ 
Proposition 9: The matrix R 2 is block diagonal with 2 
blocks, each of size 2x2. 

Proof: Proof is similar to the proof of Proposition [3] ■ 



